Resolution of Identity Crisis of Events in Pile-up 



Avinash A. Deshpande & Harsha Raichur 
Raman Research Institute, C. V. Raman Avenue, Sadashivanagar, Bangalore 560080, India 



Abstract 

Mutually uncorrelated random discrete events, manifesting a common basic process, are examined 
often in terms of their occurrence rate as a function of one or more of their distinguishing attributes, such 
as measurements of photon spectrum as a function of energy. Such rate distributions obtained from the 
observed attribute values for an ensemble of events will correspond to the "true" distribution only if the 
event occurrence were mutually exclusive. However, due to finite resolution in such measurements, the 
problem of event pile-up is not only unavoidable, but also increases with event rate. Although extensive 
simulations to estimate the distortion due to pile-up in the observed rate distribution are available, no 
restoration procedure has yet been suggested. Here we present an elegant analytical solution to recover the 
underlying true distribution. Our method, based on Poisson statistics and Fourier transforms, is shown 
to perform as desired even when applied to distributions that are significantly distorted by pile-up. Our 
recipes for correction, as well as for prediction, of pile-up are expected to find ready applications in a 
wide variety of fields, ranging from high-energy physics to medical clinical diagnostics, and involving, but 
not limited to, measurements of count-rates and/or spectra of incident radiation using Charge Coupled 
Devices (CCDs) or other similar devices. 



To formulate the pile-up problem analytically, 
let us denote the true and apparent event-rate dis- 
tribution by X(S) and X a (S) respectively, where S 
is a chosen attribute of the events we are inter- 
ested in. Let A be the (spatio-temporal) resolu- 
tion with which these measurements are made. The 
basic situation in which 'pile-up' occurs can be il- 
lustrated using the following simple example. Con- 
sider two events (say, with S = Si & S2) that occur 
within the measurement resolution A. Since both 
the events occur within A they would be mistaken 
for a single event with an attribute S sum — S± + S2- 
Therefore, instead of registering two events, one 
each at S = Si and S = S2, only one event at 
S = S sum would be noted. As a result, the rate of 
events at Si and at £2 are underestimated, with a 
corresponding overestimation of event rate at S SU m ■ 

Generalizing the above example, if n events 
(say at, Si, i = 1 —> n; n > 1) were to overlap, 
then for each such occurrence, the event count at 
S S um — Sj=i Si is not only wrongly incremented 
by one, but a count is missed at each Si. Con- 
sequently, in addition to this mistaken identity in 

even the net event count suffers a deficit of n-1, 
for each unresolved occurrence of n events. Thus, 



in general, any measured X a (S) deviates from the 
corresponding true distribution \(S) due to a finite 
probability of unresolved events occurring within 
A. It should be noted here that, even though 
the total rate of events is thus underestimated, the 
rate-weighted integral of S remains conserved, that 
is 

< (1) 
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Let us now consider events of a given fixed attribute 
value So and examine the probability of occurrence 
of one or more of such events. Since the mutually 
independent discrete events are expected to follow 
Poisson statistics, the probability of, in general, k 
(> 0) such events occurring within the resolution 
A, is given by the function 
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where A is again the mean rate of events, or more 
specifically, the average number of events expected 
per A. Thus ki events, each with same attribute 
value S{, occurring within A would be unresolved, 



1 



and hence, they together would be mistaken for 
one event of strength Sf — faiSj. Thus, even 
though the true rate distribution is non-zero only 
at S = Si, the apparent probability density distri- 
bution (PDD) spreads to all non-negative integral 
multiples of Si when events are viewed with reso- 
lution A, as 

Pt(S? = k i S i )= [ ^>Ll (4 ) 

with an implicit maximum event count of one per 
A. 



In general, for an ensemble of mutually inde- 
pendent discrete events with a range of the true 
attribute values (say, Si, i = 1 — > N), the resul- 
tant PDD across the apparent attribute value S a 
(= YliLi wou ld be a grand convolution of the 
apparent PDDs (as in equation|4]) corresponding to 
each of the respective apparent value S?. 



Thus, 
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Substituting this result in equation [51 we get 
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where P(f) = E 4 =i A (^) e~^ Sif , which is the 
Fourier transform of the true distribution. 
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where (^) denotes convolution product, and again, 
ki is the number of events with true attribute Si, 
occurring within A. 

If P a (f) & Pi a (f) are the Fourier transforms 
of P a (S a ) & P?(S?) respectively, the convolution 
theorem would relate them as follows, 

N 

^ a (/)=n^ a (/) ( 6 ) 

8=1 

where Y[ denotes a simple product. 

The Fourier transforms appearing in the prod- 
uct on the right-hand side of the above equation, 
i.e. P i a (f), can be obtained in general for any i, 
by summing over ki the Fourier contribution from 
each of the components of P°"(Sf) (see equation 
E}, evaluated at the discrete values of Sf = faS 1 ,. 



By taking natural logarithm of both sides and 
rearranging, we obtain 



In 



N 

(>(/)) = P(f) E A (^) ( 9 ) 



This relation between the Fourier transforms of 
the true and the apparent distributions of events, 
should enable recovery of the underlying true rates 
or counts of events, as a function of a chosen at- 
tribute, from the corresponding observed distribu- 
tion, often distorted due to pile-up. 

The suggested recipe is 



1. One begins with measurements over a total 
number of, say, M independent resolution cells, 
each of size A, providing a record of discrete 
events (numbering, say, N c , where N c < M). 
Using such data, the events are sorted and 
counted according to the apparent value of 
their chosen attribute (S a ). The sorted event- 
count distribution, say C a (S a ), is normalized 
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Figure 1: Illustration of a distribution affected by pile-up (top panel; red) which is obtained through Monte- 
Carlo simulations, from an assumed model for a "true" distribution (top panel; green). The resultant 
distribution after pile-up correction (top panel; black) matches the model "true" distribution, within the 
statistical uncertainties. The bottom panel shows the amount of estimated correction (bottom panel; black), 
the difference with the recovered and the model distributions (bottom panel; green) , along with a smoothened 
version of the latter (bottom panel; red). 



by M to obtain an apparent probability den- 
sity distribution (PDD) of events across S a , 
that is P a (S a ) = C a (S a )/M. The PDD value 
at S a = 0, if not known a priori or explicitly, 
can be estimated trivially and is > 0, such that 
the total probability (including that at S a = 0) 
equals unity. 

2. P a (S a ) is Fourier transformed to obtain a 
so-called characteristic function, P a {f), but 
avoiding any normalization by the number of 
points (N) transformed. 

3. This crucial step involves computing X(f) 



In (p°(/)), such that if P a (f) = a(f)e j ^\ 
then X(f) = ln(a(/)) + j<f>(f), where j = 

4. Inverse Fourier transforming X(f) (now with 
usual normalization by N) gives the true event- 
rate distribution X(S) across the S range, 
along with a dip at S = whose magnitude 
is Eti ^ for Si ? 0. 

5. At S = 0, one may ignore this (dip) contri- 
bution completely, or compare its magnitude 
with the integral over the rest of the S'-rangc 
to assess internal consistency. The X(S) thus 
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obtained is multiplied by M to get the esti- 
mate of true distribution of event counts C(S), 
or further divided by A to get the underlying 
event-rate distribution in relevant basic units 
(such as per unit area and/or per unit time). 



It is important to satisfy the constraint, inte- 
gral of P a (S a ) being equal to unity, in general, and 
also to ensure that the rate-weighted integral of S 
is conserved as desired (see Equation [2J through 
the above restoration procedure. It is easy to show 
from Equation [5] that 
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where the first derivatives of P and P a with re- 
spect to /, when evaluated at / = 0, correspond 
to respective rate- weighted integrals of S, or to the 
first moments of the respective distributions. In 
contrast with the above, the true event-rate dis- 
tribution X(S) is not expected to follow any such 
constraint on its integral. 

Figure 1 illustrates application of our proce- 
dure, and its result. For simplicity, and without 
loss of generality, all distributions have been binned 
with ^-interval of unity. The level of correction ef- 
fected by the procedure is clearly evident on the 
left-side part of the distribution, where this home- 
coming of counts is accompanied by correspond- 
ing deportation out of the right-side region. In the 
present example, the restored count totals to about 
1.7 million (consistent with the original model dis- 
tribution), of which about 32% was lost due to pile- 
up. Although the original distribution is confined 
to attribute values S < 900, the apparent distri- 
bution extended well beyond due to pile-up, with 
about 45000 counts in the range S > 900. We note 
that the restored distribution is seen also confined 
to S < 900, and at larger S, any deviation now 
from the expected count (i.e. zero) is found to be 
well within statistical uncertainties. 

It is important to emphasize that the above 
procedure for pile-up correction works equally well 
also for two-sided distributions. Note that when 
the distribution P a (S a ) is one-sided (either S > 0, 
as in Figure 1, or S < 0), the real and imaginary 



parts of P a (f) represent a Hilbert pair, and so do 

the corresponding parts of In LP°(/)J, consistent 

with the recovered P(S) also being one-sided. Also, 
the derived relation (Eq. [5]) provides a direct way 
for predicting a piled-up distribution, if the true 
distribution is known. 



Applications and Discussion 



Pile-up effects have been encountered and discussed 
in a wide variety of contexts and measurements 
over the past several decades, as apparent from the 
non-exhaustive list below. 



1. In X-ray astronomy (e.g. the Suzaku 16 1 
and the Chandra jj| missions), particularly 
where CCDs or similar detectors are employed 
for measuring (photon) energy spectra, apart 
from imaging, of celestial sources. 

2. In high-energy physics experiments 0] (e.g. us- 
ing the Large Hadron Collider at CERN, in- 
cluding the on- going hunt for Higgs bosonQ), 
particle detectors are equipped with triggers to 
evaluate interaction among high energy parti- 
cles which are being studied. Cosmic ray de- 
tectors are also equipped with similar triggers 
which are enabled when cosmic rays of suffi- 
ciently high energy enter the Cherenkov de- 
tectors 0- In both contexts, fake triggers can 
occur due to pile-up of multiple events of lower 
energy, and on the other hand, using higher 
thresholds (to reduce such ghost or phantom 
particles) can "miss" to detect a real particle. 

3. In radiation measurement application and/or 
in Gamma spectroscopy etc., using solid state 



detectors (e.g. Si(Li); NaI(Tl))@, E 



11] 



4. In medical clinical diagnostics 0, H, [IH such 
as radio-nuclide therapy dosimetry imaging, 
micro-dosimetry of inhaled a— emitters (e.g. 
in measurements of specific energy spectra of 
epithelial cells of bronchiolar airways) and car- 
diac first-pass imaging, using Gamma Cam- 
eras. 
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5. In neutrino mass determination, using micro- 
calorimeter to measure the entire spectrum of 
187 Re (MARE experiment [H). 



The prevailing approaches to tackle the pile-up is- 
sue are: a) use as high a spatio-temporal resolu- 
tion as possible, b) reduce rate of events, if con- 
trollable, or c) restrict to regions with significantly 
reduced event rates, thus ignoring potentially valu- 
able data from regions that would be rich in events. 
The latter two result in poor statistics, compro- 
mising sensitivity of measurements. In the ab- 
sence of any correction procedure so far, iterative 
procedures [3, [T3, EH U| to seek the underlying 
"true" distribution are in use, wherein Monte-Carlo 
simulations of events, following assumed models of 
the "true" distribution and of pile- up, are employed 
to obtain simulated apparent distributions that are 
compared with those measured. 

The simple relation derived by us (Eq. ^ and 
the correction procedure presented here should find 
ready applicability, in the above mentioned and 
other relevant areas, for recovering the underlying 
true distribution of events, even when the observed 
distributions have significant distortion due to pile- 
up. This, in turn, would enable such measurements 
with significantly improved sensitivity, and detec- 
tion of features or events otherwise masked by the 
pile-up distortion. 
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